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RECEIVER DEPENDENT SELECTION OF A WORST-CASE TIMING 
EVENT FOR STATIC TIMING ANALYSIS 

FIELD OF THE INVENTION 

[0001] The field of the present invention relates to electronic design automation, and 
5 more particularly, to methods and systems for timing analysis of electronic circuit blocks. 

BACKGROUND OF THE INVENTION 

[0002] Advances in silicon technology increasingly allow larger and more complex 

10 designs of electronic circuits to be formed on a single chip. For example, modern electronic 
designs have millions or tens of millions of transistors. At the same time, market demands push 
circuit designers to create these designs rapidly and efficiently. A recent trend to increase the 
speed and efficiency of the design process involves the re-use, or recycling, of electronic circuit 
blocks or subsystems, commonly referred to as cores, Intellectual Properties (IPs), or virtual 

15 component blocks (VCs). Once the design for a virtual component block has been tested and 
verified, it can be re-used in other applications that may be completely distinct from the 
application which led to its original creation. 

[0003] For example, a subsystem for a cellular phone application specific integrated 
circuit (ASIC) may contain a micro-controller, a digital signal processor, and other electronic 

20 components. After the design for the cellular phone subsystem has been tested and verified, it 
could be re-used as a virtual component block in a circuit design for an automotive application. 
The reuse of virtual component blocks allows a designer to complete the design process much 
faster than building the entire design from scratch, and avoids the need for debugging, testing 
and verification of the subsystems embodied in the virtual component block. 

25 [0004] While virtual components have been found to be convenient for expediting and 

simplifying the circuit design process, the successful use of virtual component blocks hinges on 
the ability of the designer to accurately characterize their timing and functionality. A number of 
techniques have been proposed and developed for performing timing analyses on virtual 
component blocks, among which static timing analysis (STA) is most widely used. 
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[0005] Static Timing Analysis is used in the process of verifying the timing correctness 
of a digital circuit design during one clock cycle, without the need for simulating the circuit. 
During the STA process, a worst-case structural (or topological) delay between a circuit's inputs 
and outputs is calculated. For example, a model of a signal that propagates through 
5 combinational logic includes an analysis of the longest and the shortest paths spanning between a 
launching register and a capturing register in order to determine, in the worst case, whether the 
signal arrives at the capturing register during the active pulse of the clock cycle. 

[0006] In STA, a rising or falling voltage transition is abstracted by a timing event 
(TE), to approximate the transition of the actual waveform using two of its parameters, the 

10 arrival time and slew rate. The arrival time of the transition is based on the time that the voltage 
of the waveform reached a user-selected reference voltage (V re f), or trip point, such as 0.5 Vdd 
for example. The slew rate is determined by an amount of time for the waveform to move from 
one given voltage level, V^i, to a second given voltage, V t h2- For example, the slew rate may be 
based on the amount of time that the waveform takes to transition from a voltage of 0.2Vdd to 

1 5 0.8 Vdd. However, given the complexity of modern designs, the approximation of the waveform 
provided by the TE may be insufficient to verify the design's timing correctness. 

[0007] For example, in a STA process, a design is represented by an acyclic directed 
graph, called a Timing Graph (TG), where timing nodes represent intermediate in the design and 
edges represent delays along nets and library cells. The TG is built using an assumption that the 

20 delay through a given stage is dependent only on the waveform that is received by the stage, and 
on the interconnects of the stage. During the analysis, TEs are propagated forward in the TG 
from input nodes to output nodes. If a stage has multiple inputs, the corresponding node in the 
TG receives multiple TEs. A worst-case TE is selected from the multiple TEs at the timing node 
for further propagation to downstream logic elements during the analysis. 

25 [0008] In this conventional approach, only the arrival time of the timing eveni at the 

given node is used as the criterion for selecting the worst-case timing event, and the slew rate is 
typically not considered. However, ignoring the slew rate can lead to optimistic results for the 
critical path delay that is verified during this process, especially if a skewed gate (a gate with a 
trip point away from 0.5 Vdd) is used. For example, a TE may be selected as the worst-case 

30 according to the conventional approach, which is based on the arrival time of the TE on the 
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particular node. However, this "worst-case" TE may not be the worst-case TE if the TEs are 
propagated to the output of a skewed receiver, because the slew differences of the signals may 
have a strong impact on the delay over the receiving stage. 

[0009] To prevent optimistic results, conventional tools [CTEref] allow a user to 
5 construct a new timing event by combining the worst arrival time, (such as the maximum time 
for a max delay analysis or the minimum time for min delay analysis) and the worst slew rate 
(such as the lowest or highest slew rate for, respectively, max or min delay analysis). The arrival 
time and slew rate of the new "worst-case" timing event is therefore a combination of various 
parts of different timing events, rather than a selection of one of the TEs that arrives at the 
10 receiving gate. While this approach is usually conservative, it may lead to optimistic results if 
the receiving gate is skewed. For example, if an inverter is skewed towards a low voltage (e.g. 
with similar sizes of p- and n- devices), a rising transition with a lower slew rate can result in a 
lower delay over the inverter. 

[0010] Furthermore, the conventional analysis does not consider properties of the 
1 5 receiving gate itself when determining the worst-case transition. This causes results based on 
conventional criteria to be either optimistic or very conservative, and may lead to functional 
failures or over design. 

[0011] Therefore, more accurate timing verification methods are needed. [ICC AD ref]. 
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SUMMARY OF THE INVENTION 

[0012] A method for determining a worst-case timing event is disclosed. The method 
includes determining a plurality of output arrival times and slew rates for the plurality of input 
timing events based on the timing model and load of receiving gate, and selecting a worst-case 
5 input timing event from the plurality of input timing events based on the arrival times computed 
on the output of the receiving gate. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] Figure 1 shows an example of a circuit design partitioned into stages and into a 
timing graph to perform receiver-dependent static timing analysis, 

[0014] Figure 2 shows a schematic example of a worst-case transition selection using 
5 receiver-dependent static timing analysis. 

[0015] Figure 3 shows an example of a circuit for which conventional selection 
criterion leads to optimistic results, and receiver dependent analysis leads to more accurate 
results. 

[0016] Figure 4 shows simulation results of transitions applied to the circuit in Figure 
10 3, including a worst-case transition selected by the receiver dependent method. 

[0017] Figure 5 shows an example of a stage of a complex gate with pull-down stacks 
having unbalanced strengths, causing different output slew rates which are analyzed by the 
receiver dependent worst-case selection method. 

[0018] Figure 6 shows examples of simulation results of transitions applied to the stage 
15 shown in Figure 5. 

[0019] Figure 7 shows results of circuit simulations of the circuit shown in Figure 3 
having an upsized driver to improve the input slew of the circuit, which is considered during the 
receiver dependent selection method. 

[0020] Figure 8 shows example of a circuit for which logic relations between 
20 neighboring nets can lead to different slews and arrival times on the input of receiver if effects of 
noise-on-delay are considered in the receiver dependent STA. 
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DETAILED DESCRIPTION 

[0021] A method of static timing analysis selects a worst-case timing event, or 
transition, based on characteristics of the receiving gate, during the design and verification 
5 process of an electronic circuit, so that the circuit can be fabricated onto a VLSI chip. Because 
the receiver-dependent process selects the worst-case timing event at the input of the receiving 
gate based on the receiving gate's characteristics, such as the gate's delay for example, the 
receiver-dependent STA results are more accurate than methods that only consider the input 
timing event's features. 

10 [0022] The receiver-dependent STA selects the worst-case timing event from multiple 

timing events that are sent by one or more driving gates. The selected worst-case timing event is 
then propagated through the receiving gate to the downstream logic to determine the delays of 
the circuit design. Calculated delays include the largest and smallest delays along various 
sensitizable paths that connect sampling blocks (such as flip-flops or latches), or primary inputs 

15 and outputs of the circuit, in the direction of the data flow, for example. 

[0023] Figure 1 shows an example of a circuit design 100 that is partitioned into stages 
x, y, z, and p to generate the TG 1 10. The nodes x, y, z, and p of the TG correspond to inputs of 
the gates where timing events are determined. The timing arcs 102, 103, 104 of the circuit 
represent sensitizations, or delays, encountered by signals as they propagate through the gates 

20 105 and 106. These sensitizations are represented by the timing events 1 12, 1 13, 1 14 of the TG 
110. 

[0024] During the analysis, the TG 1 10 is traversed in topological order from the input 
nodes (sources) x and y to the output node (sink) p corresponding, respectively, to primary inputs 
and outputs of the circuit 100. The multiple sensitizations (timing arcs) 102, 103 over the 
25 NAND gate 105 of circuit 100 leads to multiple timing events 1 12, 1 13 that are received by node 
z of TG 1 10. The timing event with the worst-case transition is selected to be propagated 
through the input of inverter 106 at node z and is output by the inverter at node p. This worst- 
case selection process is based on characteristics of the inverter. 

[0025] Figure 2 shows that the two transitions, 112 and 113, arriving at node z, have 
30 different arrival times and slews: the transition with the later arrival time, 1 12, has a smaller slew 



PA:521 15838.1/2019535-2195357097 



6 



Express Mail Label No. EV348160526US 



Patent 
CA7017522001 



than the earlier transition 113. These different slews and arrival times of the transitions arriving 
at the input of the inverter are used, along with the receiving inverter's characteristics, in 
selecting the worst-case timing event 1 13, which is schematically shown in Figure 2. For 
example, the selection method can consider the effects of the inverter on the transitions from the 
5 arrival times and slews of the transitions 1 12' and 1 13' at the receiver's output to identify the 
worst-case input transition, 113. The receiver dependent selection process is therefore more 
accurate than a conventional selection method, which chooses the event with the later arrival 
time at the gate's input, max {T it n}- 

[0026] For example, considering the input transitions as shown in Figure 2, input 

10 transition 1 12, which has the later arrival time, is chosen as the worst-case transition for further 
propagation by a conventional method. But when both signals are propagated to the output of 
the inverter, the transition 1 12 is not the worst-case, because the inverter's characteristics affect 
the output times of the transitions 1 12' and 1 13' As shown in Figure 2, the output delay for the 
signal 1 13 that travels along x to p is longer than the output delay of the signal 1 12 that travels 

1 5 along path y to p. Therefore, the signal 1 12 is selected as having the worst-case transition using 
the receiver dependent selection method. 

[0027] The output time, T 0 , of a transition on the output node OUT may be computed 

based on the transition's arrival time at the input of the inverter, Tj, and the delay over the gate to 

the output, D g : 
20 T 0 = T{ + D g (1) 

where the times T 0 and T( of the output and input transitions are selected by a user to be the time 

each reaches its crossing time V re f. 

[0028] The delay D g of a gate depends in part upon the slew of the input transition, and 
the capacitive load at the gate's output. The gate delay, D g , can therefore be represented as an 
25 algebraic function of the input slew, S i9 and the gate's total capacitive load, C [k-factor_ref]): 

D g = F(S if C) (2) 

[0029] Similarly, the output slew can be represented as a function of input slew and 
capacitive load: 

So = Q(S if C) (3) 
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[0030] For N timing events propagated to an input of a receiving gate, T i n , £,> denote, 
respectively, arrival times and slews of an n-th timing event at the input of the receiver, for 
n-l..N, where N is number of the timing events. The corresponding output transition is then 
denoted as {T 0>n , S 0 , n }. These values are calculated by introducing input slews, S itft , and the total 
5 load capacitance C of the net driven by the receiver, into the gate delay model described by 
equations (2) and (3), such that: 

D gtn =F(S itnf C) (4) 

S 0 ,n = Q(S Un Q (5) 

[0031] The values of D g>n and S 0 , n may then be calculated based on given values for S i>n 

10 and C. The input slew and total capacitance values may be estimated using a timing model of 
the receiving gate. However, in many cases, especially for modern processes, wires have non- 
negligible resistance. Therefore, using the total capacitance as the load C of a gate when 
calculating the gate delay and the output slew may introduce inaccuracy. In such cases, an 
effective capacitance may be used in equations (4) and (5) instead of the total capacitance. The 

15 effective capacitance, C e ff, of the load may be determined with an iterative method, and used to 
more precisely approximate the actual load in terms of output delay [Pileggi]. The iterative 
method may be performed using an interconnect simplification, and may involve expansive 
numerical computations. Therefore, if the nets have a large resistance, one can use the iterative 
method of determining C e ff, and store the parameters of the simplified net for later use. 

20 [0032] In general, using C to t as the receiver load in Eq.(2) can provide the correct 

ordering of output timing events according to their corresponding arrival times. For example, the 
delay is usually a monotonic function of output load, and C e ^is usually close to the total 
(lumped) capacitance C m of the driven net. Therefore, C tot instead of C e g may usually be used in 
equations (4) and (5) to select the worst-case timing event. This avoids a need in expensive 

25 processing of the output net and C e jf iterations. 

[0033] The values C tot and Si for a given gate, which are used to calculate the output 
delays and output slews, may be available from a library of timing models, such as LIBERTY 
[lib_ref] or TLF[tlf_ref]. For example, the library may store a delay model of a gate, including 
the input slew and capacitive load data of the gate, in a table of a library, as shown in Table 1. 
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Si \ 


lOfF 


30ff 


lOOfF 


300ff 


20ps 


30ps 


40ps 


80ps 


200ps 


50ps 


50ps 


70ps 


lOOps 


250ps 


200ps 


80ps 


120ps 


180ps 


350ps 


500ps 


150ps 


200ps 


260ps 


450ps 



Table 1: Delay table for gate as a function of input slew and output load 
The columns of Table 1 are headed by capacitive load data, and the rows are headed by input 
5 slew data. The data for the capacitive load and input slew values shown in Table 1 may be 
determined from the results of a number of circuit simulations. For example, the slew of the 
input transition may be determined by the difference between simulated crossing times of 
selected threshold voltages, Vm and V t h2, of the transition. Also, when using the tabular format, 
an appropriate interpolation between available values may be performed during the analysis. 

10 [0034] The delay and slew data may be also fitted using cubic splines having 

coefficients that are also stored in tables in a library. For example, the coefficients stored in 
Tables 2 and 3 are used in TLF expressions to determine gate delay and output slew values. 
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Tables 2 and 3: Examples of spline representation of data in TLF format 



Cell(ssad2 

TIMINGJVlodel(ioDelayRise ModelO 
timingJ)y_trans_and_capOMod 
(spline 
data 
( 

(0.202000 0.273000 0.517000 1.496000) 
(0.272000 0.344000 0.641000 1.566000) 
(0.325000 0.396000 0.691000 1.614000) 
(0.415000 0.489000 0.794000 1.717000) 

) 

) 

) 

Table 3 

TLF Load Table 

TMINGJVlodel(SIopeRiseModel0 
timingJbyJrans_and_capJ)Mod 
(Spline 
data 

( 

(0.173000 0.316000 0.983000 3.090000) 
(0.179000 0.324000 0.986000 3.090000) 
(0.189000 0.333000 0.991000 3.092000) 
(0.219000 0.366000 1.020000 3.106000) 

) 

) 

) 



[0035] After determining the gate delay and output slew by introducing the input slew 
and load capacitance values to equations (4) and (5), the arrival times of the transitions at the 
receiver output are then founded as: 

T 0f n = T iin +D n (6) 

[0036] The worst-case timing event selected for the further propagation in the max 
delay analysis is that having latest arrival time: 
max{T 0 J, forn=/...# (7) 

[0037] For the min delay analysis one needs to choose a minimum of arrival times 
instead of maximum. 

[0038] To illustrate the receiver dependent selection process, and to quantify the 
improvement over conventional methods, consider an example of the stage shown in Figure 3. 
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The driver is an inverter from a O.13u-technology industrial library skewed towards the low 
voltage, meaning that n-device is stronger than the p-device. Results of simulations of the circuit 
for two different transitions at the input are shown in Figure 4. The two input transitions are 
linear saturated ramps of 20 ps and 120 ps, respectively, and arrival times differ by 30 ps (as 
5 defined by using Vref equal to 50% of Vdd). 

[0039] The two transitions on the output node N of the skewed inverter have close 
slews, but as is shown in Figure 4, the input transition with the worse input slew, which is the 
linear ramp marked with rectangular boxes, has the output transition with the earlier arrival time, 
which is the transition curve marked with circles. The transition with the later input arrival time, 
1 0 which is the ramp marked with diagonal crosses, has a better input slew and a later output arrival 
time, as shown by the curve marked with horizontal and vertical crosses. If a conventional 
selection criterion is used, then the input with the sharper input slew (the ramp with the diagonal 
' crosses) is chosen as the worst-case transition, and leads to a -20% underestimation of delay on 
this stage. If the worse input slew is combined with worse output arrival time to form a new 
1 5 timing event to be propagated further, then a 25% overestimation (conservative) on this stage 
would occur. The receiver dependent worst-case selection method described is therefore more 
efficient in selecting the worst-case transition from a plurality of transitions with different arrival 
. times and slews, which in this example is the ramp with the better input slew (the ramp with 
diagonal crosses). 

20 [0040] Practical examples of situations where the receiver dependent method improves 

the simulation results include converging paths, unbalanced stacks, critical path fixing and 
optimization, and noise-on-delay effects. 

[0041] Converging paths . Timing events with different arrival times and slews can 
emerge due to converging paths. For example, most gates of circuit designs have more that one 

25 input. If two or more inputs of a gate receive transitions with very different slews, it is probable 
that the output transitions will also have different slews. This may cause an inversion of the 
arrival times of the transitions on subsequent stages, especially if the stages are skewed. 

[0042] Unbalanced stacks . Unbalanced strength of device stacks in complex gates 
can also lead to significant differences in slews of the output transitions. An example of such a 

30 circuit is shown in Figure 5. The circuit has two gates, a driver-complex gate NOT(A0*Al+B), 
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and a receiver-inverter. The complex gate NOT(AO*A1+B) has different strengths of pull-down 
stacks corresponding to vectors with switching inputs B and Al, respectively. The different 
strengths of the two vectors cause the two rising transitions B and Al at the node N to have 
different output slews at the node OUT, and as a result, the earlier/later arrival times of the 
5 transitions are inverted at the receiver's output. 

[0043] For example, the simulation results of two transitions on inputs AO and B of the 
circuit of Figure 5 are shown in Figure 6, by the two saturated ramps . The input transition to 
AO is shown in the ramp marked with diamond boxes, and the input transition to B is shown by 
the ramp having a solid line. The output transitions at the output of the driver have different 

10 slews and arrival times: transition B has the faster input slew, but has an output transition that 
arrives later than the output corresponding to input transition AO, as shown by the output 
transition for the input to B, which is marked with circles, as compared with the output transition 
for the input to AO, which is marked with crosses. As a result, the responses on the output of 
inverter arrive in an order that is the opposite of the order of their input slews. For example, the 

1 5 transitions at the output node OUT, show that the transition caused by the input to node B (the 
curve marked with rectangular boxes) arrives later than the transition caused by the input to node 
AO (the curve marked with diagonal boxes). Therefore, the input signal which arrives earlier on 
nodes Al and N has the worst-case transition, and is selected to be propagated on the inverter 
output, OUT. 

20 [0044] Critical path fixing and circuit optimization . During a fixing stage of the 

design chain, gates belonging to a critical path are often upsized. This affects (mostly improves) 
both the arrival time and slew of a timing event on the input of the next gate. Although the 
arrival time is the target of the stage optimization, the improved slew may, depending on the 
skew property of the receiving gate, have a significant impact on the arrival time on the next 

25 stage. The receiver-dependent selection method is therefore more accurate than tools using the 
conventional worst-case event selection method, because they usually neglect this slew 
improvement, which may lead to usage of a sub-optimal cell. 

[0045] For example, Figure 7 shows simulation results for two inverters of different 
sizes, each having the circuit design of Figure 3. The simulation results are obtained by 

30 applying the same input transition to the different drivers, which is represented by the linear 
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saturated ramp of 50ps, which is marked with rectangular boxes. The original inverter has sizes 
p/n =1 .92u/l . 12u. The upsized driver is a stronger inverter having sizes p/n=3.84u/2.52u. The 
output of the smaller driver to node N is shown by the curve marked with crosses. The output 
transition of the signal from the smaller driver to node OUT is shown by the curve marked with 
5 diamonds. The output of the bigger driver on node N is shown by the curve marked with circles, 
and the output transition at node OUT is shown by the curve marked with triangles. 

[0046J As seen in Figure 7, the delay on the receiver's input improves by -1 lps when 
the driver's size is increased. Conventional analysis criterion leads to an over-design of the 
circuit, because the improved slew on the output of the inverter reduces the delay. The receiver 

10 dependent selection process leads to a more efficient design, because the effects of the improved 
slew rate can be considered based on measurements at the output of receiver. The simulation 
result of upsizing on the delay that is determined with the receiver dependent selection method 
therefore becomes significantly more accurate, at ~17ps. 

[0047] Noise-on-delay effect . Several tools provide the ability to account for coupling 

15 noise effects on delay [CTE/celtic]. The noise effects, such as the logic correlation between 
victim and aggressor nets, can lead to different slews and arrival times for the transitions at the 
input and output of the victim net. An example of such a circuit shown in Figure 8. The 
aggressor net B causes a push-out and increase of slew of the transition on the victim net N when 
both are transitioning in opposite directions, such as when the falling transition on N is caused by 

20 rising transition on B. However, the net B should not be included in the list of active (switching) 
aggressors when the falling transition on N is caused by rising transition on A, since this would 
violate the sensitization. Therefore, in the case of significant coupling between N and 5, the 
difference in slews may be accounted for using the receiver dependent method. 

[0048] The receiver-dependent static timing analysis provides a method, which 

25 indicates how to choose a signal to be propagated to the next gate in the design. The method 
includes determining an output slews and arrival times for each signal that is input to a gate, 
based on a timing model of the gate, and selecting a worst delay input signal from the input 
signals based on the output slews or delays, or both. For example, the signal with the worst 
output delay as a function of input slew may be selected. Also, the signal with the worst output 

30 slew as a function of input slew may be selected. The timing model of the receiving gate is used 
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to find which of the timing events will cause the worst-case arrival time of a signal at a sampling 
element on the path. The disclosed method therefore provides more accurate and conservative 
selection criteria for identifying the worst-case timing. It also improves quality of circuit 
optimization for example helping to minimize over-design of driver upsizing. 
5 [0049] The timing analysis method may be performed by an automated process, where 

a computer readable medium stores a computer program that has instructions for performing 
receiver-dependent static timing analysis. A computer or processor that receives the computer 
instructions from the computer memory device then executes the instructions to perform the 
analysis. In the automated process, the circuit design to be analyzed may include a virtual circuit 

1 0 block with enough information to calculate the output timing delays. The circuit design may, for 
example, be in a netlist format, with the gates referred to in the netlist being pre-characterized to 
the extent necessary to perform the timing analysis. Information about timing (i.e., propagation) 
delays across the gates in the netlist may be stored in a timing library format (TLF) expression or 
in any other suitable format. 

1 5 [0050] These and other embodiments of the present invention may be realized in 

accordance with the above teachings and it should be evident that various modifications and 
changes may be made to the above described embodiments without departing from the broader 
spirit and scope of the invention. The specification and drawings are, accordingly, to be 
regarded in an illustrative rather than restrictive sense and the invention measured only in terms 

20 of the claims. 
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